Search CORE

42 research outputs found

Random Sampling of States in Dynamic Programming

Author: B.J. Stephens
C.G. Atkeson
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Local dimensionality reduction

Author: Atkeson C.G.
Schaal S.
Vijayakumar S.
Publication venue
Publication date: 01/01/1998
Field of study

Edinburgh Research Explorer

Using humanoid robots to study human behavior

Author: Atkeson C.G.
Hale J.G.
Kawato E.
Kawato M.
Kotosaka S.
Pollick F.E.
Riley M.
Schaul S.
Shibata T.
Tevatia G.
Ude A.
Vijayakumar S.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2000
Field of study

Our understanding of human behavior advances as our humanoid robotics work progresses-and vice versa. This team's work focuses on trajectory formation and planning, learning from demonstration, oculomotor control and interactive behaviors. They are programming robotic behavior based on how we humans “program” behavior in-or train-each other

CiteSeerX

Crossref

Enlighten

Probabilistic Inference for Fast Learning in Control

Author: A. Girard
C.E. Rasmussen
C.E. Rasmussen
C.G. Atkeson
E. Snelson
J. Peters
K. Doya
R.S. Sutton
S. Schaal
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

We provide a novel framework for very fast model-based reinforcement learning in continuous state and action spaces. The framework requires probabilistic models that explicitly characterize their levels of confidence. Within this framework, we use flexible, non-parametric models to describe the world based on previously collected experience. We demonstrate learning on the cart-pole problem in a setting where we provide very limited prior knowledge about the task. Learning progresses rapidly, and a good policy is found after only a hand-full of iterations

Crossref

Spiral - Imperial College Digital Repository

MPG.PuRe

On Optimizing Locally Linear Nearest Neighbour Reconstructions Using Prototype Reduction Schemes

Author: B.V. Dasarathy
C.G. Atkeson
C.J.C. Burges
C.L. Chang
G.L. Ritter
I. Tomek
J.C. Bezdek
K. Fukunaga
K. Fukunaga
P. Kang
P.E. Hart
S.-W. Kim
S.-W. Kim
S.T. Roweis
S.T. Roweis
Publication venue: Springer Berlin / Heidelberg
Publication date: 01/01/2010
Field of study

This paper concerns the use of Prototype Reduction Schemes (PRS) to optimize the computations involved in typical k-Nearest Neighbor (k-NN) rules. These rules have been successfully used for decades in statistical Pattern Recognition (PR) applications, and have numerous applications because of their known error bounds. For a given data point of unknown identity, the k-NN possesses the phenomenon that it combines the information about the samples from a priori target classes (values) of selected neighbors to, for example, predict the target class of the tested sample. Recently, an implementation of the k-NN, named as the Locally Linear Reconstruction (LLR) [11], has been proposed. The salient feature of the latter is that by invoking a quadratic optimization process, it is capable of systematically setting model parameters, such as the number of neighbors (specified by the parameter, k) and the weights. However, the LLR takes more time than other conventional methods when it has to be applied to classification tasks. To overcome this problem, we propose a strategy of using a PRS to efficiently compute the optimization problem. In this paper, we demonstrate, first of all, that by completely discarding the points not included by the PRS, we can obtain a reduced set of sample points, using which, in turn, the quadratic optimization problem can be computed far more expediently. The values of the corresponding indices are comparable to those obtained with the original training set (i.e., the one which considers all the data points) even though the computations required to obtain the prototypes and the corresponding classification accuracies are noticeably less. The proposed method has been tested on artificial and real-life data sets, and the results obtained are very promising, and has potential in PR applications

Crossref

Carleton University's Institutional Repository

NORA - Norwegian Open Research Archives

Agder University Research Archive

Adaptive Optimal Feedback Control with Learned Internal Dynamics Models

Author: C.G. Atkeson
D.H. Jacobson
D.P. Bertsekas
E. Todorov
E. Todorov
E. Todorov
M. Grebenstein
M. Katayama
N. Özkaya
P. Dyer
P.I. Corke
R. Shadmehr
R. Shadmehr
R.F. Stengel
S. Klanke
S. Schaal
S. Vijayakumar
S. Vijayakumar
T. Flash
W. Li
Y. Uno
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

Optimal Feedback Control (OFC) has been proposed as an attractive movement generation strategy in goal reaching tasks for anthropomorphic manipulator systems. Recent developments, such as the Iterative Linear Quadratic Gaussian (ILQG) algorithm, have focused on the case of non-linear, but still analytically available, dynamics. For realistic control systems, however, the dynamics may often be unknown, difficult to estimate, or subject to frequent systematic changes. In this chapter, we combine the ILQG framework with learning the forward dynamics for simulated arms, which exhibit large redundancies, both, in kinematics and in the actuation. We demonstrate how our approach can compensate for complex dynamic perturbations in an online fashion. The specific adaptive framework introduced lends itself to a computationally more efficient implementation of the ILQG optimisation without sacrificing control accuracy – allowing the method to scale to large DoF systems

CiteSeerX

Crossref

Edinburgh Research Archive

Differential Effects of Target Height and Width on 2D Pointing Movement Duration and Kinematics

Author: Adam J.J.
Atkeson C.G.
Darling W.G.
Fitts P.M.
Gordon J.
Gordon J.
Hoffman E.R.
Howarth C.I.
Ketcham C.J.
Lajoie J.M.
Langolf G.D.
MacKenzie C.L.
Messier J.
Meyer D.E.
Morasso P.
Murata
Murata
Rand M.K.
Rand M.K.
Rand M.K.
Rosenbaum D.A.
Soechting J.F.
Teasdale N.
Publication venue: 'Human Kinetics'
Publication date
Field of study

Crossref

Trajectory-Based Dynamic Programming

Author: A. Altamimi
A. Altamimi
C.G. Atkeson
C.G. Atkeson
D.H. Jacobson
E. Frazzoli
J.A. Boyan
J.D. Schierman
J.J. Murray
J.N. Tsitsiklis
P. Dyer
R.L. Larson
R.S. Sutton
S. Ramamoorthy
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Crossref

You did it on purpose! Towards intentional embodied agents

Author: A.M. Meltzoff
B. Boer de
B. Jansen
B.F. Malle
C.G. Atkeson
G. Butterworth
Publication venue: SPRINGER-VERLAG BERLIN
Publication date: 01/01/2004
Field of study

The paper describes a road-map towards intentional behavior in artificial systems. We catch the developmental path in two dimensions, a social and an intentional dimension. Starting out with a babbling phase, development continues over an exploratory phase without social interactions and a phase in which action-level imitation is used. The pinnacle of development is the intentional imitation of goals. An experiment, together with preliminary results, is presented for each developmental phase

Crossref

Ghent University Academic Bibliography